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A morphism a is unambiguous with respect to a word a if there is no other morphism T that maps 
a to the same image as a. In the present paper we study the question of whether, for any given 
word, there exists an unambiguous 1-uniform morphism, i. e., a morphism that maps every letter in 
the word to an image of length 1 . 



1 Introduction 

If, for a morphism o : A* — » £* (where A and £ are arbitrary alphabets) and a word a G A*, there exists 
another morphism T mapping a to a (a), then o is called ambiguous with respect to a; if such a T 
does not exist, then o is unambiguous. For example, the morphism Oo : {A,B,C}* — > {a,b}* - given by 
On (A) := a, 00(B) := a, On(C) := b - is ambiguous with respect to the word Ofo := ABCACB, since the 
morphism To - defined by To (A) := e (i. e., To maps A to the empty word), To(fi) := a, Tn(C) := ab - 
satisfies To(ao) = Oo(ao) and, for a symbol X occuring in a, Tq(X) / Oo(X): 

ob (A) aa(B) cto(c) o (A) 00(C) co(B) 
Oo(ao) = fl ^ Q b ^ a & , s f =/ R)(Oo). 

T (B) Tb(C) T (C) T (B) 

It can be verified with moderate effort that, e.g., the morphism Oi : {A,B,C}* — >■ {a,^}* - given by 
Oi(A) := a, G\(B) := Oi(C) := b - is unambiguous with respect to Oo- 

The potential ambiguity of morphisms is relevant to various concepts in the combinatorial theory 
of morphisms, such as pattern languages (see, e.g., Mateescu and Salomaa [9]), equality sets (see, eg., 
Harju and Karhumaki @) and word equations (see, e. g., Choffrut [2]). This relation is best understood 
for inductive inference of pattern languages, where it has been shown that a preimage can be computed 
from some of its morphic images if and only if these images have been generated by morphisms with 
a restricted ambiguity (see, e. g., Reidenbach ifTOll ). Hence, intuitively speaking, unambiguous mor- 
phisms have a desirable, namely structure-preserving, property in such a context, and therefore previous 
literature on the ambiguity of morphisms mainly studies the question of the existence of unambiguous 
morphisms for arbitary words. In the initial paper, Freydenberger, Reidenbach and Schneider [5] show 
that there exists an unambiguous nonerasing morphism with respect to a word a if and only if a is not a 
fixed point of a nontrivial morphism, i. e., there is no morphism <p satisfying 0(a) = a and, for a symbol 
x in a, <p(x) 7^ x. Freydenberger and Reidenbach [4 ] study those sets of words with respect to which so- 
called segmented morphisms are unambiguous, and these results lead to a refinement of the techniques 
used in [5 ]. Schneider [ 13 ] and Reidenbach and Schneider [ 12 ] investigate the existence of unambiguous 
erasing morphisms - i. e., morphisms that may map symbols to the empty word. Finally, Freydenberger, 
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Nevisi and Reidenbach [ 3 ] study a definition of unambiguity that is completely restricted to nonerasing 
morphism^ and they provide a characterisation of those words with respect to which there exist unam- 
biguous morphisms a : A + — > £ + in such a context (this characterisation does not hold for binary target 
alphabets E, though). 

In the present paper, we study the existence of unambiguous 1 -uniform morphisms for arbitrary 
words, i. e., just as our initial example Go, these morphisms map every symbol in the preimage to an 
image of length 1. In order to obtain unrestricted results, we wish to consider words over an unbounded 
alphabet A as morphic preimages. Therefore, we assume A := N; in accordance with the existing liter- 
ature in the field, we call any word a € N* a pattern, and we call any symbol x € N occurring in a a 
variable. Thus, more formally, we wish to investigate the following problem: 

Problem 1. Let a £ N* be a pattern, and let £ be an alphabet. Does there exists a 1-uniform morphism 
O : N* — > £* that is unambiguous with respect to a, i. e., there is no morphism T : N* — > £* satisfying 
t(gc) = o(oc) and, for a variable x occurring in a, t(x) ^ o(x)? 

There are two main reasons why we study this question: Firstly, any insight into the existence of 
unambiguous 1-uniform morphisms improves the construction by Freydenberger et al. 0, which pro- 
vides comprehensive results on the existence of unambiguous nonerasing morphisms, but is based on 
morphisms that are often much more involved than required. This can be illustrated using our above 
example pattern cZq (now interpreted as Ofo := 1 - 2 ■ 3 • 1 ■ 3 - 2 in order to fit with the definition of patterns 
as words over N). Here, the unambiguous morphism <7i - which is not 1-uniform, but still of very limited 
complexity - produces a morphic image of length 8, whereas the unambiguous morphism for Oq defined 
in leads to a morphic image of length 162. This substantial complexity of known unambiguous mor- 
phisms has a severe effect on the runtime of inductive inference procedures for pattern languages, which, 
as mentioned above, are necessarily based on such morphisms. Thus, any insight into the existence of 
uncomplex unambiguous morphisms is not only of intrinsic interest, but is also important from a more 
applied point of view. Secondly, as shown by Gq^OCq), the images under 1-uniform morphisms have a 
structure that is very close to that of their preimages. This is because, whenever the pattern contains 
more different variables than there are letters in the target alphabet, a 1-uniform morphism reduces the 
complexity of the preimage by mapping certain variables to the same image. Thus, such a morphic 
simplification and its potential ambiguity are a very basic phenomenon in the combinatorial theory of 
morphisms. Our studies shall suggest that Problem[T]is nevertheless a challenging question, and we shall 
demonstrate that it is related to a number of other concepts and problems in combinatorics on words. 

Note that, due to space constraints, this extended abstract contains just a few proofs, focussing on 
those that are reasonably short and suitable to illustrate our basic proof techniques. 

2 Definitions and Preliminary Results 

For the definitions of patterns, variables, 1-uniform morphisms, {unambiguous morphisms , fixed points 
of nontrivial morphisms, and the symbol e, Section[T]can be consulted. 

Let A be an alphabet, i. e., an enumerable set of symbols. A word (over A) is a a finite sequence of 
symbols taken from A. The set A* is the set of all words over A, and A + := A* \ {e}. For the concatenation 

'Note that |5. 4| also deal with unambiguous nonerasing morphisms, but they use a stronger notion of unambiguity that 
is based on arbitrary monoid morphisms. Hence, they call a morphism o" unambiguous only if there is no other - erasing or 
nonerasing - morphism t satisfying l(a) = o"(a). In contrast to this, and in contrast to the present paper, |3 | disregards erasing 
morphisms r. Consequently, in the definition of unambiguity studied by [3J, our initial example o"o is considered ("weakly") 
unambiguous with respect to oso, since all morphisms T with t(o!()) = o"o(<*o) are erasing morphisms. 
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of two words Wi,m>2, we write w\ ■ W2 or simply w\W2- The notion \x\ stands for the size of a set x or 
the length of a word x. For any word w G A*, the notation \w\ x stands for the number of occurrences of 
the letter x in w. The symbol [. . .] is used to omit some canonically defined parts of a given word, e. g., 
a = 1 • 2 •[...]• 5 stands for a = 1 • 2 • 3 • 4 • 5. We call a word v G A* a factor of a word w G A* if, for 
some Mi,«2 £ A*, w = U\VU2, moreover, if v is a factor of w then we say that w contains v and denote 
this by v C w or w = • • • v • • • . If v 7^ w, then we say that v is a proper factor of w and denote this by 
v C w. If «i = e, then v is a /?re/uc of w, and if «2 = £, then v is a of w. For every letter x in w, 
L x := {y GA | w = ■■■y-x---}UL' x andR x := {y £ A \ w = ■ ■ ■ x ■ y ■ ■ ■} U R x , where L' x = {e} if w =x- ■■ 
and Z^. = if w ^ x ■ ■ ■ , and R' x = {e} if w = ■ ■ -x and R' x = if w ^ • • • x. We refer to the sets L x and 7?^ 
as neighbourhood sets. 

For alphabets A,B, a mapping h : A* — > B* is a morphism if /j is compatible with the concatenation, 
i. e., for all v, w G A*, /j(v) • = /i(vw). We call B the torggf alphabet of /i. The morphism is said 
to be nonerasing if, for every x£A, ^ e. A morphism is called a renaming if it is injective and 
1 -uniform. We additionally call any word v a renaming of a word w if there is a morphism h that is a 
renaming and satisfies h(w) = v. A word w G A* is said to be in canonical form if it is lexicographically 
minimal (with regard to any fixed order on A) among all its renamings in A*. 

With regard to an arbitrary pattern a G N*, var(a) denotes the set of all variables occurring in a. 
If we say that a pattern is in canonical form, then this shall always refer to the usual order on N, i. e., 
1 <2<3<.... 

The question of whether a pattern a is a fixed point of a nontrivial morphism (which can be decided 
in polynomial time, see Holub [7 ]) is equivalent to a number of other concepts in combinatorics on 
words. More precisely, a is a fixed point of a nontrivial morphism iff a is prolix iff a is morphically 
imprimitive iff there exist a certain characteristic factorisation of a; these equivalences are explained by 
Reidenbach and Schneider ifTTI in more detail. Results on unambiguous morphisms have been stated 
using any of these concepts. In the present paper, our presentation shall focus on the notion of fixed 
points. Therefore, we can now paraphrase a simple yet fundamental insight by Freydenberger et al. Q - 
which implies that an answer to Problem [TJ is trivial for those patterns that are fixed points of nontrivial 
morphisms - as follows: 

Theorem 1 (Freydenberger et al. [ 5 ]). Let a G N* be a fixed point of a nontrivial morphisms, and let E 
be any alphabet. Then every nonerasing morphism o : N* — > £* is ambiguous with respect to Of. 

Hence, we can safely restrict our subsequent considerations to those patterns that are not fixed points. 

3 Fixed Target Alphabets 

In the the present section, we describe a number of conditions on the existence of unambiguous 1 -uniform 
morphisms a : N* — > £* with a. fixed target alphabet £, i. e., the size of £ does not depend on the number 
of variables occurring in a. While the main result by Freydenberger et al. @ demonstrates that the 
set of patterns with an unambiguous nonerasing morphisms is independent of the size of £ (provided 
that |E| > 2), our initial example Oo and all patterns a m := 1 • 1 • 2 • 2 • [. . .] • m ■ m with m > 4 do not 
have an unambiguous 1 -uniform morphism o : N* — > Z* for binary alphabets E. In contrast to this, such 
morphisms can be given for ternary (and, thus, larger) alphabets: 

Theorem 2. Let m G N, m > 4, let £ be an alphabet, and let a m := 1 • 1 • 2 • 2 • [. . .] • m ■ m. There exists a 
1-uniform morphism o : N* — > £* that is unambiguous with respect to OC m if and only if \L\ > 3. 
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Proof. Since squares cannot be avoided over unary and binary alphabets, it can be shown with very 
limited effort that there is no unambiguous 1 -uniform morphism a : N* — > L* with respect to any a m if 
£ does not contain at least three letters. 

According to Thue [14], there exists an infinite square-free word over a ternary alphabet. Let this 
word be w. Thus, 

w = abcacbabcbacabcacbaca ■■■ . 
We define the word w' by repeating every letter of w twice. Consequently, 

w = aabbccaaccbbaabbccbbaaccaabbccaaccbbaaccaa ■ ■ ■ . 

We now define a 1-uniform morphism a : N* — > {a,b,c}* such that o{a m ) is a prefix of w' . Since w is 
square-free, the only square factors of w' are aa, bb and cc. Hence, it can be easily verified that a is 
unambiguous with respect to a m . □ 

Thus - and just as for the equivalent problem on unambiguous erasing morphisms (see Schnei- 
der lfT3l ) - any characteristic condition on the existence of unambiguous 1-uniform morphisms needs to 
incorporate the size of Z, which suggests that such criteria might be involved. Therefore, our results in 
this section are restricted to sufficient conditions on the existence of unambiguous 1-uniform morphisms. 

Our first criterion is based on (un)avoidable patterns and is, thus, related to the above-mentioned 
property of the patterns a m : 

Theorem 3. Let n € N, /3 := n • r 2 •[...] • r m and a := \ n ■ 2 n ■ 3 r2 ■ A n •[...]• n^"/ 2 ^ with n > 2 for 
every i, 1 < i < \n/2]. If [5 is square-free, then there exists a 1-uniform morphism a : N* — > {a,b}* that 
is unambiguous with respect to a. 

Our second criterion again holds for binary (and, thus, all larger) alphabets £. It features a rather 
restricted class of patterns, which, however, are minimal with regard to their length. 

Theorem 4. Let n € N, n > 2. Ifnis even, let 

a := 1 -2 • [. . .] • n ■ (n/2 + 1) • 1 • (n/2 + 2) • 2 • [. . .] -n -n/2, 

and ifn is odd, let 

a := 1 • 1 - 2- 3 •[...] - n ■ {\n/2\ + 1) • 2 • ([n/2] +2) • 3 •[...]■ n • [n/2] . 

Then a is a shortest pattern with | var(a)| = n that is not a fixed point of a nontrivial morphism, and 
there exists a 1-uniform morphism o : N* — > {a,b}* that is unambiguous with respect to a. 

The following examples illustrates Theorem [4] and its proof: For n := 6, a := 1-2-3-4-5-6- 
4 • 1 • 5 • 2 • 6 • 3, and the 1-uniform morphism a : N* -> {a,b}* with a(l) := a(2) := a(3) := a and 
(7(4) := (7(5) := (7(6) := b is unambiguous with respect to a. For n:=5, a:=l-l-2-3-4-5-4-2-5-3, 
and the respective unambiguous morphism is given by a(l) := a(2) := a(3) := a and a(4) := a(5) := b. 

From Theorem |4] we can conclude that patterns a with unambiguous 1-uniform morphisms using a 
binary target alphabet exist for every cardinality of var(a) and that corresponding examples can be given 
where every variable occurs just twice. 
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4 Variable Target Alphabets 

In order to continue our examination of Problem [T] we now relax one of the requirements of Section [3j 
We no longer investigate criteria on the existence of unambiguous 1 -uniform morphisms for a fixed target 
alphabet £, but we permit £ to depend on the number of variables in the pattern a in question. Regarding 
this question, we conjecture the following statement to be true: 

Conjecture 1. Let a be a pattern with | var(a) | > 4. There exists an alphabet £ satisfying |£| < | var(a) | 
and a 1 -uniform morphism O : N* — > £* that is unambiguous with respect to a if and only if a is not a 
fixed point of a nontrivial morphism. 

This conjecture would be trivially true if we allowed £ to satisfy |E| > [ var(a)|. That explains why we 
exclusively study the case where the number of letters in the target alphabet is smaller than the number 
of variables in the pattern. From Theorem 12 it directly follows that an analogous conjecture would 
not be true if we considered fixed binary target alphabets (as is done in Section [3]), since none of the 
patterns a m is a fixed point of a nontrivial morphism - this can be easily verified using tools discussed 
by Reidenbach and Schneider ifTTl and Holub |7]. Hence, characteristic criteria must necessarily look 
different in such a context. It can also be effortlessly understood that Conjecture Q] would be incorrect 
if we dropped the condition that a needs to contain at least 4 distinct variables, since not only Co, but 
all 1 -uniform morphisms a : N* — > £* with |E| < 2 are ambiguous with respect to our example pattern 
0£o = l - 2- 31-3 -2 discussed in SectionQ] 

Technically, many of our subsequent technical considerations are based on the following generic 
morphisms: 

Definition 1. Let E be an infinite alphabet, and let o : N* — > E* be a renaming. For any i,j £ N with 
i ^ j and for every iGN, let the morphism Gu be given by 



Thus, Ojj maps exactly two variables to the same image, and therefore, for any pattern a with at least 
two different variables, Oij(cc) is a word over | var(a)| — 1 distinct letters. Using this definition, we can 
now state a more specific version of Conjecture CD 

Conjecture 2. Let a be a pattern with |var(a)| > 4. There exist i,j € var(a), i / j, such that is 
unambiguous with respect to OC if and only if a is not a fixed point of a nontrivial morphism. 

As a side note, we consider it worth mentioning that Conjecture [2] shows connections to another 
conjecture from the literature. In order to state the latter, we define, for any i € N, the morphism 5, : 



Conjecture 3 (Billaud CQ, Leve and Richomme [8]). Let a be a pattern with |var(a)| > 3. If, for 
every i € var(a), 5,-(a) is a fixed point of a nontrivial morphism, then a is a fixed point of a nontrivial 
morphism. 

In general, the correctness of Conjecture [3] has not been established yet. The problem is intensively 
studied by Leve and Richomme [8], where it is shown to be correct for certain subclasses of N*. 

Due to Theorem [Q the only if directions of Conjectures Q] and [2] hold true immediately. In the 
remainder of this section, we shall therefore exclusively study those patterns that are not fixed points. Our 
corresponding results yield large classes of such patterns that have an unambiguous 1 -uniform morphism, 
but we have to leave the overall correctness of our conjectures open. 




N* -> N* by 8i(i) := e and, for every j G N \ {/}, := j. 
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Conjecture [2] suggests that the examination of the existence of unambiguous 1-uniform moronisms 
for a pattern a may be reduced to finding suitable variables i and j such that a, j is unambiguous with 
respect to a. In this regard, one particular choice can be ruled out immediately: 

Proposition 1. Let a be a pattern, and let i,j G var(ot), i ^ j. If Ofj(ot) is a fixed point of a nontrivial 
morphism, then 0,j is ambiguous with respect to a. 

For example, if we consider the pattern 05i := 1-2-3-4-1-4-3-2 (which is not a fixed point) and define 
£ := {a,b,c}, then 02,4(051) equals abcbabcb (or any renaming thereof), which is a fixed point of the 
morphism given by 0(a) := abcb and <j>(b) := 0(c) := e. Thus, 02,4 is ambiguous with respect to 
tt\. However, Proposition Q] does not provide a characteristic condition on the ambiguity of O/j, since 
02,3(051) = abbcacbb is not a fixed point, but still 02,3 is ambiguous with respect to G5i. Furthermore, 
while the ambiguity of 02,3 results from the fact that G5i contains the factors 2 • 3 and 3 • 2, and is therefore 
easy to comprehend, there are more difficult examples of morphisms o, j that are ambiguous although 
they do not lead to a morphic image that is a fixed point. This is illustrated by the example 0C2 := 
1-2-3-3-4-4-1-2-3-3-4-4-2. Here, 02,4(0:1 ) = abccbbabccbbb again is not a fixed point, but 02,4 
is nevertheless ambiguous with respect to 052, since the morphism t given by t(1) := abccb, t(2) := 
b and t(3) := t(4) := £ satisfies t((Xq) = 02,4(052). We therefore conclude that it seems not to be a 
straightforward task to find amendments that could turn Proposition Q] into a characteristic condition. 

We now show that Conjecture [2] is correct for several types of patterns. To this end, we need the 
following simple sufficient condition on a pattern being a fixed point: 

Lemma 1. Let a G N + . If there exists a variable i G var(a) such that 

1. £ G" Li and, for every k £ L,-, = {/}, or 

2. e R{ and, for every k £ Rj, L^ = {/}, 

then a is a fixed point of a nontrivial morphism. 

Using this lemma, we can now establish a class of patterns for which Conjecture [2] holds true. All 
variables in these patterns have the same number of occurrences and satisfy some additional conditions: 

Theorem 5. Let m G N, m > 1. Let a G N + be a pattern that is not a fixed point of a nontrivial morphism 
and satisfies, for every x G var(a), \a\ x = m. If there are i,j G var(a), i 7^ j, such that 

• there is no k G var(a) with {i,j} C L^ or {i,j} C R^, and 

• a ^ oci -i- j- a% ■ j i- a$, oil, ax, 053 G W, 
then o,- j is unambiguous with respect to a. 

Proof. Assume to the contrary that o,,y is ambiguous. So, there exists a morphism z : N + — > L* satisfying 
t(oc) = Ofj(a) and, for some x G var(a), t(x) ^ o,j(jc). Since o,j is a 1-uniform morphism, there 
exists a k G var(a) with |t(&)| > 2. Let uv C t(^), m,v G E. Due to the fact that k occurs m times in a, 
o,-j(a) = t(oj) = wi • wv • W2 ■ uv •[...]• w TO • uv ■ w m+ \ with, for every q, 1 < q < m + 1, w q G £*. We now 
consider the following cases: 

• o,-j(j) ^ m and o,- ;(?) 7^ v. This implies that there exist the variables xi,x 2 G var(a), xi,X2 7^ / 
and xi,JC2 7^ j, such that a = a\ -x\X2 • 0:2 -xiX2 •[•••]• o; m -xiX2 ■ 05 m+ i, for every a, \ < q <m + \, 
a q G N*, and o,-j(xi) = u and o e -j(x2) = v. Due to |a| Xl = \oc\ X2 = m, the variables xi,X2 satisfy, 
for every q with 1 < q < m + 1, Xi,X2 % cc q . This implies that R Xl = {x 2 } and L X2 = {xi}. Then, 
according to Lemma[U a is a fixed point of a nontrivial morphism, which is a contradiction to the 
assumption of the theorem. 



164 



Unambiguous 1-Uniform Morphisms 



• Ojj-(i) = Gij(j) = u, and u^v. So, we assume that a = (X\ ■ x\x' ■ CC2 ■ xjx' •[...]• cc m ■ x m x' ■ a m+ \ 
with, x' G var(a) and, for every q, 1 <q<m+\,x q E var(a), a q G N*, and Oij(x q ) = u and 
Oij(x') = v. Additionally, since <7,-j(x') = v and « / v, we can conclude that x' 7^ i and x' 7^ j. We 
now consider the following cases: 

1. For every q, 1 < q < m, x q = i. This implies, using the same reasoning as above, that a is a 
fixed point of a nontrivial morphism which is a contradiction. 

2. There exists q,q', 1 < q,q' < m and q 7^ q' , such that x q = i and x q i = j. This means that 
{i,j} C L^, which contradicts the first condition of the theorem. 

• Oij(i) = v, and u 7^ v. The reasoning is analogous to that in the previous case. 

• Gij(i) = Oij(j) = u and v = u. Hence, we may assume that a = (X\ ■ x\x\ ■ (X2 ■ X2x' 2 ■[...]■ a m - 
x m x' m ■ a m+ i with, for every q, 1 < q < m+ 1, a q G N*, x q ,x' q G var(a) and Oij(x q ) = Oij(x' q ) = u. 
Due to the conditions of the theorem, the factors i • i ■ j, i- j ■ j, j i- i and j ■ j ■ i cannot be factors of 
a. Moreover, it must be noticed that u-u-u% t(k); otherwise, since x(a) = Oij(a), then \a\i > m 
or 0Cj > m. This implies that i- j ■ i and j -i- j are not factors of a. We now consider the following 
cases: 

1. For every q, 1 < q < m, x q = i and x' q = j. As a result, Rj = {j} and Lj = {/}. According to 
Lemma[T] a is a fixed point of a nontrivial morphism. 

2. For every q, 1 < q <m, x q = j and x' q = i. Thus, Rj = {/} and L, = {j}, which, due to 
Lemma [U again implies that a is a fixed point of a nontrivial morphism. 

3. There exists a q, q', 1 < q, q' < ni and q ^ q' , such that x q - x' q = i- j and x q i ■ x' q , = j ■ i. This 
case contradicts the second condition of the theorem. 

4. There exists a q,q', 1 < q,q' < m and q ^ q 1 , such that x q -x' q = i ■ j and, x q > ■ x' q , = i-i or 

-x' q , = j ■ j. This means that {i,j} C Rj or {i,j} C Lj, which is a contradiction to the first 
condition of the theorem. 

5. There exists a q,q', I < q,q' < m and g ^ q', such that x q -x' q = j ■ i and, • = i ■ i or 
x^/ -x' q i = j ■ j. This implies that {i, j} C L, or {/, j} C Rj, which contradicts the first condition 
of the theorem. 

6. There exist q,q', 1 < q,q' < m, q' 7^ q, such that x q -x' q = i ■ i and -x^, = j ■ j. Since 
mm C t(^) and due to the conditions of the theorem, it follows from T(a) = Gu(a) that 

/ i and k 7^ 7. In other words, t(i') 7^ mm and x(j) 7^ mm; otherwise, |r(a)| M > |a,-j(a)|„. 
Moreover, it must be noticed that if Gij(k) C t(^), then this implies that there exists x G 
var(a) \ with {/, j} C L v or C which is a contradiction. Thus, <Jij(k) ^ t(&). 
Since r(a) = a, j(a), there must be a fe' G var(a), fc' 7^ fc, 1,7, such that Gij(k) C t(^'), which 
means that |t(/c') | > 2, or we can extend the reasoning to other variables. Consequently, since 
t(ch) = cr(cu), this discussion implies the existence of a k" G var(a), k" 7^ k, i,j, such that 
|t(&")| > 2, which, according to the above cases, leads to a contradiction. 

□ 

We wish to point out that Theorem [5] does not only demonstrate the correctness of Conjecture [2] for the 
given class of patterns, but additionally provides an efficient way of finding an unambiguous morphism 
Gu. For example, we can immediately conclude from it that (7^4 is unambiguous with respect to our 
above example pattern a,\ . Furthermore, the theorem also holds for patterns with less than four different 
variables. 
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We now consider those patterns that are not a fixed point and, moreover, contain all of their variables 
exactly twice (note that some of these "shortest" patterns that are not fixed points are also studied in 
Theorem IU). We wish to demonstrate that Theorem [5] implies the existence of an unambiguous Ofj for 
every such pattern. This insight is based on the following lemma: 

Lemma 2. Let a G N + be a pattern with | var(a) | > 6 and, for every x G var(cu), \cc\ x = 2. Then there 
exist i,j G var(ot), i ^ j, such that 

• there is no k G var(a) with {i,j} C or {i,j} Q Rh and 

• a ^ ai i j • a 2 j i - a?, ai,a2,«3 e N*. 

Proof. Let n := |var(a)|. Since every variable occurs exactly twice in a, it directly follows that, for 
every x G var(ce), \R X \ < 2 and \L X \ < 2. By omitting the neighbourhood sets containing e, we have at 
most 2n — 2 sets of size 2. Besides, it can be verified with little effort that a contains at most n — 1 
different factors i • /, i, j G var(a), i ^ j, such that j ■ i C a (e. g., for n := 4, a := 1 • 2 • 3 • 4 • 4 • 3 • 2 • 1 has 
3 different factors i ■ j, i,j G var(ce), i / j, satisfying j ■ i C a). Assume to the contrary that, for every 
i,j G var(ce), one of the following cases is satisfied: 

• there exists a k G var(ce) with C or C R^, or 

• a = ai i j • a 2 j i - «3, «i,a2,«3 G N*. 

As mentioned above, the maximum number of pairs that are covered by the first case is 2n — 2, and for the 
second case it is n — 1. On the other hand, since | var(ce)| = n, there exist (JJ) different pairs of variables. 
However, for n > 6, we have > (2n — 2) + (n — 1), which contradicts the assumption. □ 

Hence, whenever a pattern a is not a fixed point, the conditions of Theorem [5] are automatically 
satisfied if a contains at least seven distinct variables and all of its variables occur exactly twice. Using 
a less elegant reasoning than the one on Lemma 12 we can extend this insight to all such patterns over at 
least four distinct variables. This yields the following result: 

Theorem 6. Let a G N + be a pattern with \ var(a)| > 3 and, for every x G var(a), \a\ x = 2. If a is not 
a fixed point of a nontrivial morphism, then there exist i,j G var(a), i 7^ j, such that Gj ; is unambiguous 
with respect to a. 

Theorem [6] does not only directly prove the correctness of Conjecture [2] for all patterns that contain 
all their variables exactly twice, but it also allows a large set of patterns to be constructed for which the 
Conjecture holds true as well. This construction is specified as follows: 

Theorem 7. Let a := ai • j3 • GC2 and J := GC\ ■ 0C2 be patterns with ai,Gt2,p G N*, such that 

• 7 and p are not a fixed point of a nontrivial morphism, 

• I var(y)| > 3 and,for every x G var(y), \y\ x = 2, or \ var(j8)| > 3 and, for every x G var(/3), |J3|* = 2, 
and 

• var(y) n var(/3) = 0. 

Then there exist i,j G var(a), i 7^ j, such that (T; j is unambiguous with respect to (X. 

In the remainder of this section, we shall not directly address the morphism a, >j any longer. Hence, 
we focus on Conjecture [TJ and we use an approach that differs quite significantly from those above: We 
consider words that cannot be morphic images of a pattern under any ambiguous 1 -uniform morphism, 
and we construct suitable morphic preimages from these words. This method yields another major set of 
patterns for which Conjecture Q] is satisfied. 

Our corresponding technique is based on the well-known concept of de Bruijn sequences. Since de 
Bruijn sequences are cyclic, which does not fit with our subject, we introduce a non-cyclic valiant: 
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Definition 2. A non-cyclic De Bruijn sequence (of order n) is a word over a given alphabet £ (of size k) 
for which all possible words of length n in £* appear exactly once as factors of this sequence. We denote 
the set of all non-cyclic De Bruijn sequences of order n by B'(k,n). 

For example, the word wq '■= aabacbbcca is a non-cyclic de Bruijn sequence in Z?'(3,2) if we assume 
£ := {a,b,c}. 

It can now be easily understood that a non-cyclic de Bruijn sequence cannot be a morphic image of 
any pattern under ambiguous 1 -uniform morphisms: 

Theorem 8. Let £ be an alphabet, and let a G N + be a pattern satisfying, for every x G var(a), \a\ x > 2. 
Let a : N* — > £* be a 1-uniform morphism such that, for every u\Ui C o((X), U[,U2 G £ the factor u\U2 
occurs in o((X) exactly once. Then a is unambiguous with respect to a. 

This insight implies that every pattern that can be mapped by a 1-uniform morphism to a de Bruijn 
sequence necessarily is not a fixed point, and thus, fits with Conjecture [TJ 

Corollary 1. Let £ be an alphabet, and let a G N + be a pattern satisfying, for every x G var(a), \a\ x > 2. 
Let o : N* — > £* be a 1-uniform morphism such that, for every u\Ui C o((X), u\,U2 G £, the factor u\U2 
occurs in c((X) exactly once. Then a is not a fixed point of a nontrivial morphism. 

We now show how we can construct patterns that fit with the requirements of Theorem [8] and Corol- 
lary QJ 

Definition 3. Let £ := {ai,U2,. ■ . Let B'(k,2) be the set of non-cyclic de Bruijn sequences of order 
2 over £ Then YloB(k) Q N* is the set of all patterns that can be constructed as follows: For every 
w G B'(k,2) and every letter aj in w, all nj occurrences of aj are replaced by \rij/2\ different variables 
from a set Nj := {xj { ,xj 2 , . . . ,xj^ /2J }CN, such that the following conditions are satisfied: 

• for every x £ Nj, \a\ x > 1, 

• for all i, i', 1 < i, i' < k, with i / i', N, fl N? = 0, and 

• for all i, 1 <i <k, the variables in Ni are assigned to occurrences of a, in a way such that the 
resulting pattern is in canonical form. 

For instance, with regard to our above example word wq = aabacbbcca GB'(3,2), Definition [3] says that, 
e. g., the pattern l-l-2-3-4-2-2-4-4-3is contained in 11^(3). 

From this construction, it follows that Conjecture Q] holds true for every pattern in YlDB{k): 

Theorem 9. Let £ := {a\,ci2,. . k > 3. Then, for every a ^Ylmik), 

• var(ot) contains at least k+\ elements, and 

• there exists a 1-uniform morphism o : N* — > £* that is unambiguous with respect to a. 

Proof. We begin this proof with the first statement of the theorem: It is obvious that there are k 2 different 
words of length 2 over £. The shortest word that contains k 2 factors of length 2 has length k 2 + 1, which 
means that this is the length of any word w G B'(k,2). Thus, there must be at least one letter in w that 
has at least \(k 2 + l)/k\ occurrences. Since we assume k > 3, this means that this letter has at least 
4 occurrences. From Definition [3] it then follows that this letter is replaced by at least two different 
variables when a pattern a G TloBik) is generated from w. Since all other letters in w must be replaced 
by at least one variable, this shows that | var(a) | > k + 1. 

Concerning the second statement, we define a by, for every j, 1 < j <k, and for every x G Nj, 
a(x) := aj. Thus, a is 1-uniform, and a(a) G B'(k,2). This implies that, for every u\U2 E 
u\,U2 G £, the factor u\U2 occurs in a (a) exactly once. Consequently, according to Theorem [U a is 
unambiguous with respect to a. □ 



H. Nevisi & D. Reidenbach 



167 



We conclude this paper with a statement on the cardinality of IIofl(&), demonstrating that the use of 
de Bruijn sequences indeed leads to a rich class of patterns a with unambiguous 1 -uniform morphisms, 
and that these morphisms, in general, can even have a target alphabet of size much less than var(a) — 1 
(as featured by Theorem [9]>: 

Theorem 10. Let k G N. Then \U DB (k)\ > k\^ k ~ x \ and, for every a G U DB (k), 

| var(a)| = (k - l)[k/2\ + [(k+ 1)/2J . 
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